NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Manifold Learning in Wasserstein Space

https://doi.org/10.1137/23M1617540

Hamm, Keaton; Moosmüller, Caroline; Schmitzer, Bernhard; Thorpe, Matthew (June 2025, SIAM Journal on Mathematical Analysis)

Free, publicly-accessible full text available June 30, 2026
Partial differential equations in data science

https://doi.org/10.1098/rsta.2024.0249

Bertozzi, Andrea L; Drenska, Nadejda; Latz, Jonas; Thorpe, Matthew (June 2025, Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences)

The advent of artificial intelligence and machine learning has led to significant technological and scientific progress, but also to new challenges. Partial differential equations, usually used to model systems in the sciences, have shown to be useful tools in a variety of tasks in the data sciences, be it just as physical models to describe physical data, as more general models to replace or construct artificial neural networks, or as analytical tools to analyse stochastic processes appearing in the training of machine-learning models. This article acts as an introduction of a theme issue covering synergies and intersections of partial differential equations and data science. We briefly review some aspects of these synergies and intersections in this article and end with an editorial foreword to the issue. This article is part of the theme issue ‘Partial differential equations in data science’.
more » « less
Free, publicly-accessible full text available June 5, 2026
Expected Sliced Transport Plans

Liu, Xinran; Martín, Rocío D; Bai, Yikun; Shahbazi, Ashkan; Thorpe, Matthew; Aldroubi, Akram; Kolouri, Soheil (April 2025, International Conference on Learning Representations)

Free, publicly-accessible full text available April 24, 2026
Rates of convergence for regression with the graph poly-Laplacian

https://doi.org/10.1007/s43670-023-00075-5

Trillos, Nicolás García; Murray, Ryan; Thorpe, Matthew (November 2023, Sampling Theory, Signal Processing, and Data Analysis)

Abstract In the (special) smoothing spline problem one considers a variational problem with a quadratic data fidelity penalty and Laplacian regularization. Higher order regularity can be obtained via replacing the Laplacian regulariser with a poly-Laplacian regulariser. The methodology is readily adapted to graphs and here we consider graph poly-Laplacian regularization in a fully supervised, non-parametric, noise corrupted, regression problem. In particular, given a dataset$$\{x_i\}_{i=1}^n$$ ${x_{i}}_{i = 1}^{n}$ and a set of noisy labels$$\{y_i\}_{i=1}^n\subset \mathbb {R}$$ ${y_{i}}_{i = 1}^{n} \subset R$ we let$$u_n{:}\{x_i\}_{i=1}^n\rightarrow \mathbb {R}$$ $u_{n} : {x_{i}}_{i = 1}^{n} \to R$ be the minimizer of an energy which consists of a data fidelity term and an appropriately scaled graph poly-Laplacian term. When$$y_i = g(x_i)+\xi _i$$ $y_{i} = g (x_{i}) + ξ_{i}$ , for iid noise$$\xi _i$$ $ξ_{i}$ , and using the geometric random graph, we identify (with high probability) the rate of convergence of$$u_n$$ $u_{n}$ togin the large data limit$$n\rightarrow \infty $$ $n \to \infty$ . Furthermore, our rate is close to the known rate of convergence in the usual smoothing spline model.
more » « less
From Graph Cuts to Isoperimetric Inequalities: Convergence Rates of Cheeger Cuts on Data Clouds

https://doi.org/10.1007/s00205-022-01770-8

García Trillos, Nicolás; Murray, Ryan; Thorpe, Matthew (April 2022, Archive for Rational Mechanics and Analysis)

Abstract In this work we study statistical properties of graph-based clustering algorithms that rely on the optimization of balanced graph cuts, the main example being the optimization of Cheeger cuts. We consider proximity graphs built from data sampled from an underlying distribution supported on a generic smooth compact manifold$${\mathcal {M}}$$ $M$ . In this setting, we obtain high probability convergence rates for both the Cheeger constant and the associated Cheeger cuts towards their continuum counterparts. The key technical tools are careful estimates of interpolation operators which lift empirical Cheeger cuts to the continuum, as well as continuum stability estimates for isoperimetric problems. To the best of our knowledge the quantitative estimates obtained here are the first of their kind.
more » « less
GRAND++: Graph Neural Diffusion with A Source Term

Thorpe, Matthew; Nguyen, Tan Minh; Xia, Heidi; Strohmer, Thomas; Bertozzi, Andrea; Osher, Stanley; Wang, Bao (April 2022, International Conference on Learning Representation (ICLR))

We propose GRAph Neural Diffusion with a source term (GRAND++) for graph deep learning with a limited number of labeled nodes, i.e., low-labeling rate. GRAND++ is a class of continuous-depth graph deep learning architectures whose theoretical underpinning is the diffusion process on graphs with a source term. The source term guarantees two interesting theoretical properties of GRAND++: (i) the representation of graph nodes, under the dynamics of GRAND++, will not converge to a constant vector over all nodes even as the time goes to infinity, which mitigates the over-smoothing issue of graph neural networks and enables graph learning in very deep architectures. (ii) GRAND++ can provide accurate classification even when the model is trained with a very limited number of labeled training data. We experimentally verify the above two advantages on various graph deep learning benchmark tasks, showing a significant improvement over many existing graph neural networks.
more » « less
Full Text Available
GRAND++: Graph Neural Diffusion with A Source Term

Thorpe, Matthew; Xia, Hedi; Nguyen, Tan; Strohmer, Thomas; Bertozzi, Andrea; Osher, Stanley; Wang, Bao. (January 2022, International Conference on Learning Representations)

We propose GRAph Neural Diffusion with a source term (GRAND++) for graph deep learning with a limited number of labeled nodes, i.e., low-labeling rate. GRAND++ is a class of continuous-depth graph deep learning architectures whose theoretical underpinning is the diffusion process on graphs with a source term. The source term guarantees two interesting theoretical properties of GRAND++: (i) the representation of graph nodes, under the dynamics of GRAND++, will not converge to a constant vector over all nodes even as the time goes to infinity, which mitigates the over-smoothing issue of graph neural networks and enables graph learning in very deep architectures. (ii) GRAND++ can provide accurate classification even when the model is trained with a very limited number of labeled training data. We experimentally verify the above two advantages on various graph deep learning benchmark tasks, showing a significant improvement over many existing graph neural networks.
more » « less
Full Text Available
GRAND++: Graph Neural Diffusion with A Source Term

Thorpe, Matthew; Nguyen, Tan; Xia, Hedi; Strohmer, Thomas; Bertozzi, Andrea; Osher, Stanley; Wang, Bao (January 2022, ICLR)

Full Text Available
Large data and zero noise limits of graph-based semi-supervised learning algorithms

https://doi.org/10.1016/j.acha.2019.03.005

Dunlop, Matthew M.; Slepčev, Dejan; Stuart, Andrew M.; Thorpe, Matthew (September 2020, Applied and Computational Harmonic Analysis)

Full Text Available
Poisson Learning: Graph Based Semi-Supervised Learning At Very Low Label Rates

Calder, Jeff; Cook, Brendan; Thorpe, Matthew; Slepcev, Dejan (January 2020, Proceedings of the 37th International Conference on Machine Learning, PMLR)
null (Ed.)
Full Text Available

« Prev Next »

Search for: All records